Efficient high-dimensional entanglement imaging with a compressive sensing, 

double-pixel camera 



Gregory A. Howlanc|2] and John C. Howell 

Department of Physics and Astronomy, University of Rochester, Rochester, New York, 14627, USA 

(Dated: December 24, 2012) 

We implement a double-pixel, compressive sensing camera to efficiently characterize, at high 
resolution, the spatially entangled fields produced by spontaneous parametric downconversion. This 
technique leverages sparsity in spatial correlations between entangled photons to improve acquisition 
times over raster-scanning by a scaling factor up to n^/log(n) for n-dimensional images. We image 
at resolutions up to 1024 dimensions per detector and demonstrate a channel capacity of 8.4 bits 
per photon. By comparing the entangled photons' classical mutual information in conjugate bases, 
we violate an entropic Einstein- Podolsky- Rosen separability criterion for all measured resolutions. 
More broadly, our result indicates compressive sensing can be especially effective for higher-order 
measurements on correlated systems. 
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INTRODUCTION 

Spatially entangled biphotons, such as those generated 
by spontaneous parametric downconversion (SPDC), ex- 
hibit strong Einstein-Podolsky-Rosen (EPR) type cor- 
relations [T in the transverse position and transverse 
momentum degrees of freedom [2 . Because these vari- 
ables are continuous, the entanglement can be very high- 
dimensional with a typical Schmidt number greatly ex- 
ceeding 1000 [3 . This provides high information density 
which can be leveraged to increase channel capacity and 
security for quantum key distribution [T-'G^ and dense 
coding [7j 8 . Other applications include ghost imaging 
[9l[T0], quantum computing [11 , and quantum telepor- 
tation [11]. 

Experimentally characterizing the SPDC state is un- 
fortunately difficult due to weak sources and low res- 
olution detectors. Spatial entanglement is tradition- 
ally imaged by jointly raster-scanning photon-counting 
avalanche photodiodes (APDs) to measure spatial cor- 
relations. This scales extremely poorly with increasing 
detector resolution. With a biphoton flux of 4, 000 co- 
incident detections per second, it would take 55 days to 
jointly scan a 24 x 24 pixel region for a signal-to-noise 
ratio (SNR) of 10. For 32 x 32 pixels, it would take 310 
days (see Eq. [9|. 

Other approaches have been tried with mixed success. 
Intensifled CCD cameras can measure the Schmidt num- 
ber [13] , but do not detect single-photon correlations ren- 
dering them ineffective for most quantum applications. 
Arrays of photon counting APDs could replace CCDs, 
but they are currently low resolution, noisy and resource 
intensive; especially since each pixel pair must be indi- 
vidually correlated [MHTBj . A recent, promising result 
averages intensity correlations over many images from 
a single-photon sensitive electron- multiplying CCD, re- 
porting 2500 modes p!7]. This technique is limited to a 



30 ms exposure time (APDs are sub-ns) and is noisier 
than using APDs because it does not isolate individual 
coincident detections. 

In ref [18] , Dixon et. al reduce the number of measure- 
ments required for a raster-scan by only measuring in an 
area of interest where correlations are expected, report- 
ing a channel capacity of 7 bits per photon. While not a 
true full-fleld measurement, they highlight a critical fea- 
ture of the SPDC fleld. In both position and momentum 
representations, the distribution of correlations between 
pairs of detector pixels is very sparse despite dense (not 
sparse) single-particle distributions. Applying ideas from 
the fleld of compressive sensing, we exploit prior knowl- 
edge of this sparsity to beat the "curse of dimensional- 
ity" [19 and efflciently characterize the full biphoton fleld 
without raster scanning. 

In this article, we implement a compressive sensing, 
photon-counting double-pixel camera that efflciently im- 
ages single-photon SPDC correlations in the near- and 
far-fleld at resolutions up to 32 x 32 = 1024 dimensions 
per detector. At 32x32 resolution, the measurement time 
is reduced from 310 days for raster scanning to around 8 
hours. We perform an entropic characterization showing 
channel capacities of up to 8.4 bits per photon, equiv- 
alent to 337 independent, identically distributed modes. 
Sums of channel capacities in conjugate bases violate a 
EPR steering bound ^ by up to 6.6 bits. 



THEORY 
Compressive Sensing 

Compressive sensing is a technique that employs 
optimization to measure a sparsely represented A^- 
dimensional signal from M < N incoherent measure- 
ments [2TH24] . The approach is so-named because the 
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signal is effectively compressed during measurement. 
Though sparsity is assumed, it is not known prior to 
measurement which elements contain appreciable ampli- 
tude. Compressive sensing must determine both which 
elements are significant and find their values. 

To detect a sparsely represented A/"- dimensional signal 
vector X, we measure a series of M < values Y by 
multiplying X by an M x sensing matrix A such that 

r = Ax + r, (1) 

where F is a noise vector. 

Because M < this system is undetermined; a given 
Y does not specify a unique X. The correct X is recov- 
ered by minimizing a regularized least squares objective 
function 

• ^'\Y-AX\\l + Tg{X), (2) 
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where for example \\^\\2 is the £2 norm of Q and r is a 
scaling constant. The function g{X) is a regularization 
promoting sparsity. Common g{X) include X's £1 norm, 
assuming the signal is sparse, and X's total variation, 
assuming the signal's gradient is sparse [25]. A must be 
incoherent with the basis of interest, with the surprising 
and non-intuitive result that a random, binary sensing 
matrix works well. Given sufficiently large M, the recov- 
ered X approaches the exact signal with high probability 
[26 . For a /c— sparse signal, the required M scales as 
Ma k\og{N/k). 

Incoherent, random sampling is particularly beneficial 
for low-light measurements as each measurement receives 
on average half the total photon flux <^>/2, as apposed to 
<l>/n for a raster scan. Compressive sensing is now begin- 
ning to be used for quantum applications such as state 
tomography [27 . Shabani et. al, for example, perform 
a tomography on a two qubit photonic gate for polariza- 
tion entangled photons [28]. CS has also been used to 
with spatially correlated light for ghost imaging [29l |30] . 
It is important to note that for ghost imaging, CS is not 
required to recover the full two-particle probability dis- 
tribution as in entanglement characterization. 

The quintessential compressed sensing example is the 
single-pixel camera [3TJ|32]. An object is imaged onto a 
Digital Micromirror device (DMD), a 2D binary array of 
individually-addressable mirrors that reflect light either 
to a single detector or a dump. Rows of the sensing ma- 
trix A consist of random, binary patterns placed sequen- 
tially on the DMD. For an A/"- dimensional image, mini- 
mizing Eq. [2] recovers images using as few as M = 0.02 A/" 
measurements. 



Compressive Sensing for Measuring Correlations 

The single-pixel camera concept naturally adapts to 
imaging correlations by adding a second detector. Con- 
sider placing separate DMDs in the near-field or far-field 



of the SPDC signal and idler modes, where "on" pixels 
are redirected to photon counting modules. The signal 
of interest is 



Pa,{u,v)= J dXs J dXi\f{Xs,£i)f (3) 
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{u, v) = j dks j dh\f{ks, fcl)|^ (4) 
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where p{u^ v) represents the probability of a coincident 
detection between the u^^ pixel on the signal DMD and 
v^^ pixel on the idler DMD. The functions f{xs^Xi) and 
f{ks,ki) are approximate position and momentum wave- 
functions for the biphoton 



i\){xs,Xi) =A/'exp ( \ ' I exp 



{x s -\- X{) 

ip{ks,ki) ={Aapacfj\f eyiY>{-al{ks - hf) 
xey.^{-Aal{ks + kif). 



(5) 



Subscripts s and i refer to signal and idler photons respec- 
tively, <jp and <Jc are the pump and correlation widths, 
and A/" is a normalizing constant. X of Eq. [2] is simply a 
one-dimensional reshaping of Px oi" Pk- 

Like the single-pixel camera, a series of random pat- 
terns are placed on the DMDs to form rows of A. For each 
pair of patterns, correlations between the signal and idler 
photons form the measurement vector Y . Minimization 
of Eq. [2] recovers p{u^ v). 

While a fully random A is preferred, the DMDs only 
act on their respective signal or idler subspace, which pre- 
vents arbitrary A. Rows of A are therefore outer products 
of rows of single-particle sensing sensing matrices a and 
b 



A: 



a2 



b2 



(6) 



where rows of a represent random patterns placed on the 
signal DMD and rows of b represent random patterns 
placed on the idler DMD. To make signal and idler pho- 
tons distinguishable, a and b are not the same. The 
validity of Kronecker-type sensing matrices has been es- 
tablished and is of current interest in the CS community 
as attention shifts to higher dimensional signals [f33l [34] . 
The measurement vector Y is obtained by counting co- 
incident detections for the series of DMD configurations 
given by A. 

A variety of reconstruction algorithms exist for Eq. [2| 
with their computational complexity dominated by re- 
peatedly calculating AX and A^Y [35 . This is espe- 
cially unwieldy for correlation measurements as the size 
of A is M X for n— pixel DMDs. Using properties 
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for Kronecker products [36] , these can be more efficiently 
computed by 

AX = diag(bsq(X)a^) (7) 
A^Y = vec(b^ od (y)a), (8) 

where sq and vec reshape a vector to a square matrix 
and vice- versa; diag forms a vector from the diagonal el- 
ements of a square matrix; and od forms a square matrix 
placing the operand vector on its diagonal. 

Comparison to Raster Scanning 

The compressive approach finds the joint probability 
distribution orders of magnitude faster than raster scan- 
ning through two key improvements. The first is simply 
the reduction in the number of measurements. To jointly 
raster scan an n-pixel space requires measurements. 
For a compressive measurement, sparsity is approxi- 
mately n with dimensionality n^, so only M cx nlog(n) 
measurements are required. In practice, we found excel- 
lent results when M was only three percent of n^. 

The second advantage of compressive measurements 
is that they more efficiently use available flux. For the 
raster scan, the total flux is distributed over at best n 
pairs of pixels in the case of perfect correlations. Con- 
versely, the average flux per incoherent compressive mea- 
surement is independent of n, with each measurement 
receiving on average 1/4 the total flux. To maintain 
constant SNR (photons/measurement) with increasing n, 
total measurement time therefore scales as for raster 
scanning. Given a photon flux of <l> photons per second, 
the measurement time for a desired SNR is 

2, _ ^^SNR^ 

t — 71 tmeas — <1> ' 

where tmeas is the time per measurement. 

For incoherent, compressive measurements, acquisition 
time scales as nlog(n). The compressive improvement 
therefore scales as n^/log(n). For n = 32 x 32 = 1024, 
this is of order 10^. 

This scaling factor somewhat optimistically assumes 
the reconstruction process yields an accurate result de- 
spite a noisy signal. Unfortunately, propagation of un- 
certainty through the reconstruction process remains a 
difficult problem, especially for non-ideal, real world sys- 
tems [37] • There has been much recent theoretical work 
on the topic for Gaussian j38ti4Q] and Poissonian noise 
HU |42] . These results tend to require ideal sensing ma- 
trices or more complicated formulations to give provable 
performance bounds. As such, their findings are difficult 
to directly, quantitatively apply to experiment. How- 
ever, they do reveal pertinent features that indicate CS 
can perform extremely well in the presence of noise. 



A well known characteristic of CS is a rapid phase 
change from poor to good quality reconstructions [43]. 
This phase change is often discussed as a function of in- 
creasing m, with the boundary m (x k \og{n/k). A similar 
phase transition occurs as a function of the noise level; 
in our case, this is the number of photons per measure- 
ment. For some cases, these two are even linked [38 . A 
practical compressive measurement simply requires large 
enough m and photon flux ^ to be in the space of good re- 
constructions. Fortunately, simply obtaining a recogniz- 
able reconstruction generally indicates the measurement 
conditions exceed this threshold. 

Unlike a direct measurement, the information obtained 
by a series of y compressive measurements is contained in 
their deviation from the average value y. In the presence 
of noise, these deviations must exceed the noise level. As- 
suming Poissonian shot noise, good reconstructions re- 
quire std{y) > where std{y) is the standard de- 
viation of the measurement vector and /3 is a positive 
constant greater than one. 

The particular algorithm chosen to solve Eq. [2] also 
plays a role in the reconstruction" s accuracy. These of- 
ten have provable performance on ideal signals, but de- 
grade when confronted with noisy or otherwise non-ideal 
conditions. In these circumstances, they have various 
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FIG. 1. Experimental Setup. Photons generated via SPDC 
pass through a narrow-band filter and are split into signal 
and idler modes by a 50/50 beamsplitter. For position cor- 
relations, lenses fl — 125 mm and /2 — 500 mm form a 
4/ imaging system with the crystal and DMDs placed in the 
object and image planes respectively. For momentum correla- 
tions, fl is removed and the DMD is placed in the focal plane 
of /2 = 88.3 mm. Photons striking DMD "on" pixels are di- 
rected to large area, single-photon counting modules. Photon 
arrivals are then correlated by a coincidence circuit. 
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FIG. 2. 16 X 16 Pixel Simulation. The ideal object is given in (a). The object was incoherently sampled with m — 2500 random 
binary patterns. Poissonian noise corresponding to 5000 photons in the field (~ 1250 detected) per measurement was added 
to the measurement vector. The reconstruction is shown in (b), with MSB 5 x 10~^. Plots (c) and (d) integrate along the 
anti-diagonal to show that the reconstruction recovers the correlation width ac < 1 pixel with negligible error. 



strengths, including speed, accuracy, and sensitivity to 
user selected parameters such as r in Eq. |2l For more 
information on common reconstruction algorithms, see 
Refs [HIMlBlllig. 

In practice, the best way to determine accuracy for 
a particular signal, sensing matrix, and reconstruction 
approach remains repeated simulations or experiments. 
For our system, we reduce a n = 32 x 32 measurement 
from a 310 day raster scan (SNR of 10) to an 8 hour 
compressive acquisition, a thousand- fold improvement. 



EXPERIMENT 

The experimental apparatus is given in Fig. [l] Light 
from a 2.8 mW, 325 nm HeCd laser was directed to a 
1 mm long BiBO crystal oriented for type I, collinear 
SPDC. The generated daughter photons passed through 
a 650/13 nm narrowband filter before separating into sig- 
nal and idler modes at a 50/50 beamsplitter. To measure 
position-position correlations, lenses fl = 125 mm and 



/2 = 500 mm imaged the crystal onto signal and idler 
mode DMDs. For momentum-momentum correlations, 
fl was removed and the DMDs were placed in the focal 
plane of /2 = 88.3 mm. DMD "on" pixels reflected light 
to large area, single photon counting modules (SPCM) 
connected to a correlating circuit. 

To measure p{u^ v), a series of M random patterns were 
placed on the DMDs to form the sensing matrix A. For 
each set of patterns, joint detections were counted for ac- 
quisition times taq for a total measurement time t = Mtaq 
to make up the measurement vector y. The joint distri- 
bution p{u^ v) was reconstructed using a Gradient Pro- 
jection solver for Eq. [2] with £i regularization, commonly 
referred to as Basis Pursuit Denoising [44 . 

We measured at dimensions of = 256^, N = 576^, 
and N = 1024^ corresponding to DMD resolutions of 
16 X 16, 24 X 24, and 32 x 32 pixels. The associated 
measurement numbers M were 2500, 10, 000, and 30, 000 
so that M is only about 0.03 A/". Acquisition times were 
1 second for position measurements and 1.5 seconds for 
momentum measurements to average 1000 coincident de- 
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FIG. 3. Simulated Mean Squared Error (MSE) versus Photon 
Flux for n = 256 and m = 2500. The phase change behavior 
versus photon number can be clearly seen. The experiment 
used 5000 total (1250 detected) photons per measurement to 
comfortably exceed the phase change. The MSE approaches 
a value 5 x 10~^ corresponding to an SNR of roughly 17. 



tections per DMD configuration in all cases. Addition- 
ally, we performed representative simulations at 16 x 16 
and 24 x 24 resolutions. 



RESULTS 
Joint Probability Distribution 

A simulation for measuring position-position correla- 
tions at 16 X 16 DMD resolution is given in Fig. [2] The 
object in |2|a) is the correlation function of Eq. |4] The 
simulation used m = 2500 measurements and a photon 
flux of ^ = 5000 photons/measurement multiplied by the 
ideal p{u, v), conditions representative of the 1 second ex- 
perimental acquisitions. Note that this is the total signal 
strength before interacting with the sensing matrix; the 
mean value of the measurement vector is <l>/4 = 1250 de- 
tected photons. Values of the measurement vector were 
Poissonian distributed to simulate the effect of shot noise. 

Fig. |4jb) gives the reconstructed correlation function 
p{u, v) between signal and idler DMD pixels. The sharply 
defined diagonal line shows the expected positive corre- 
lations between the two DMDs. DMD pixels are listed in 
column-major order. The mean squared error (MSE) for 
the reconstruction was 5 x 10~^. The two-dimensional 
signal marginal distribution is inset, which provides an 
image of the signal beam. Figs. |2|c) and[2|d) sum the re- 
sult along the anti-diagonal to show the correlation width 
(Jc. Qualitatively, the reconstruction closely resembles 
the original object, faltering only near the edges of the 
distribution where the signal falls beneath a noise floor. 



The reconstruction recovers dc < 1 pixel with negligible 
error. 

To demonstrate the reconstruction accuracy, simula- 
tions were performed for increasing photon flux ^ with 
DMD resolution 16 x 16 and m = 2500. The MSE versus 
^ is given in Fig. [Sj Reconstructions were normalized 
to the incident flux $ for comparison to the ideal sig- 
nal. The result shows the rapid phase change from poor 
to excellent reconstructions with a MSE converging to 
5 X 10~^ beyond the phase change. 

The MSE can be used to roughly estimate the signal- 
to- noise ratio for a particular measurement of an average, 
non-zero element. Assuming perfect pixel correlations 
and uniform marginal distributions, the energy in the 
signal is distributed over 1/n elements. The signal-to- 
noise ratio is then 1/nVMSE. For n = 256 pixels and 
MSE = 5 X 10~^, this yields an approximate SNR of 
17. For comparison, using Eq. [9) a raster scan would 
require about four days to achieve a SNR of only 10. 
The simulated CS acquisition time was 42 minutes for 
2500, 1 second measurements. 

Sample experimental reconstructions for position- 
position and momentum-momentum correlations at 16 x 
16 pixel DMD resolution are given in Figj4] As in the 
simulations, the position-position result shows a well de- 
fined diagonal line indicating positive pixel correlations. 
Conversely, the momentum-momentum result shows an 
anti-diagonal line showing the expected anti-correlations. 
Figs, gc) and gd) sum the results along the anti- 
diagonal (position-position) and diagonal (momentum- 
momentum) to reveal an effective correlation width <Jce 
of a single pixel. Our detection scheme is therefore as 
accurate as possible at this resolution and our channel 
capacity remains detector limited. 



Mutual Information in the Channel 

Once p{u^ v) is recovered, the channel capacity is given 
by the classical mutual information shared between signal 
and idler DMD pixels; 

/ = - ^p{u) \ogp{u) - ^p{v) \ogp{v) 

u V 

+ ^p(u, v) \ogp(u, v), (10) 
where for example, 

P{u) = ^p{u,v) (11) 

V 

is the signal particle's marginal probability distribu- 
tion [18^. This entropic analysis is solely measurement 
based and does not require reconstructing a wavefunc- 
tion or density matrix, a challenging task even for low- 
dimensional systems 
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FIG. 4. Sample 16 x 16 Experimental Reconstructions. [4|a) and[4|b) give the joint probability distribution for position-position 
and momentum-momentum correlations where DMD pixels are listed in column-wise order. 2D marginal distributions for the 
signal photon are inset. |4jc) and [4|d) show correlation widths of only 1 pixel by summing over the signal + idler (c) and 
signal — idler (d) axes. Only 2500 (3% of raster-scanning) measurements were needed with a total acquisition time of about 
40 minutes. 
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FIG. 5. Mutual Information Versus Thresholding. The mu- 
tual information for reconstruction values above a thresholded 
percentage of the maximum is given for 100, n — 256 pixel 
simulations with m — 2500 measurements and $ = 5000 pho- 
tons per measurement. The red, constant line gives the true 
mutual information for the simulated object. The black points 
give the n = 256 far field experimental data for comparison. 



To estimate the uncertainty in the mutual information 
from shot noise and the reconstruction process, we per- 
formed 100 simulations at n = 256 pixel resolution and 
31 simulations at n = 576 pixel resolution. Simulations 
were not performed at n = 1024 pixel resolution due to 
available computer time. In addition to the results from 
the raw reconstruction, thresholding was performed to 
provide noise reduction, where all values in the recov- 
ered p{u^ v) below a percentage of the maximum value 
are forced to zero. The simulated mutual information 
versus thresholding percentage is given in Fig. [5] for the 
n = 256 pixel simulations exemplified by Fig. [2j Error- 
bars enclose one standard deviation from repeated simu- 
lations. 

As the threshold increases from zero, the mutual in- 
formation rises as a weak, uncorrelated noise floor is re- 
moved. An optimal threshold is quickly reached, beyond 
which the threshold removes more signal than noise, re- 
ducing the mutual information. Note that the recon- 
structed mutual information is systematically lower than 
the actual mutual information in the ideal object. This is 
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FIG. 6. Mutual information between signal and idler photons for position-position and momentum- momentum representations 
are presented as a function of detector resolution. Three levels of thresholding are shown, as well as a fit to Eq. [5] Dashed 
lines are to guide the eye. Error bars enclose two standard deviations from the expected uncertainty from simulations (not 
performed for n = 1024). The solid curve represents the maximum possible value for a particular detector resolution given 
perfect correlations and uniform marginals. 



due to remaining noise and difficulty in recovering parts 
of the signal towards the tail of the distribution. 

The n = 256 far field experimental result is included 
for comparison to the simulation. The experiment closely 
matches the simulation both for no thresholding and be- 
yond its optimal threshold, but is smaller in the inter- 
mediate region. This is likely due to experimental er- 
rors not included in the simulation. These include slight 
pixel misalignment between signal and idler DMDs, op- 
tical aberrations, detector dark noise, stray light, power 
fluctuations in the laser, and temperature stability of the 
nonlinear crystal. Fig. [5] indicates that these experimen- 
tal difficulties appear to increase the uncorrelated noise 
floor rather than significantly affect the correlated part 
of the reconstruction. 

Although thresholding is a simple post-processing tech- 
nique, it is applicable to how the entangled pixels might 
be used for communication. If a pair of entangled pixels 
has a correlated amplitude near or below the background 
noise, it will be difficult to use that particular pair for 
communication. A communication scheme would likely 
only consider pixel pairs above a certain threshold to be 
useful. This is related to the technique in photonic quan- 
tum information of subtracting background noise from a 
measured signal. In CS, it is also common to perform 
post-processing or secondary optimization after maximiz- 



ing sparsity, such as the debiasing routine used in Ref. 

The experimental channel capacity versus DMD res- 
olution for both position-position and momentum- 
momentum is given in Fig. [6] for several levels of thresh- 
olding. The optimal threshold is that which maximizes 
the mutual information. At 256 and 576 pixel resolu- 
tions, optimal thresholds of 20% and 30% were used 
for position-position and momentum-momentum distri- 
butions respectively. At 1024 pixel resolution, noise was 
more significant, so the optimal thresholds increased to 
30% and 40%. Error bars on n = 256 and n = 576 pix- 
els measurements represent the expected effect of shot 
noise and reconstruction uncertainty derived from simu- 
lation. These have been conservatively set to include two 
standard deviations from the simulated result. 

The joint probability distribution was also fit to the 
double- Gaussian wavefunction (Eq. [s]) to find effective 
widths ace and ape. When ap ac, the mutual informa- 
tion between particles for Eq. [5] is the logarithm of the 
Federov ratio [49^ 

log (I) . (12) 

where the ratio is squared for two dimensions. While 
this technically applies to the continuous wavefunction. 
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FIG. 7. The sum of position-position and momentum- 
momentum mutual information is presented as function of 
detector resolution to demonstrate violation of an EPR steer- 



ing inequality (Eq. 13). The solid line represents the thresh- 
old that must be exceeded to witness EPR steering. Error 
bars enclose two standard deviations from the expected un- 
certainty from simulations (not performed for n = 1024). 
Because simulations systematically underestimate the mutual 
information, values above the bound are convincing. 



and the true cic is smaller than a DMD pixel, Eq. [12] 
still applies to the discritized measurement so long as 
the effective ace ^ cTpe- 

Fitting yielded the largest channel capacities with a 
maximum of 8.4 bits for momentum- momentum correla- 
tions at 1024 pixel resolution, equivalent to 337 indepen- 
dent, identically distributed, entangled modes. 

Given that fitting more accurately characterizes the 
system and gives a larger mutual information, it is rea- 
sonable to question the usefulness the direct mutual in- 
formation computation. However, the two approaches 
suit different purposes. Fitting is useful if one is par- 
ticularly interested in the state itself. However, if one 
intends to use correlated pixels for some other purpose, 
such as communication, the direct calculation is more 
appropriate. This is because the correlated pixels on the 
low intensity, tail of the distribution will be difficult to 
use in practice even if their amplitude can be inferred by 
fitting. 

The solid curve of Fig. [6] gives the maximum possi- 
ble mutual information between two, n-pixel detectors. 
Assuming perfect diagonal or anti-diagonal correlations 
and uniform marginals, this is simply log(n). Because we 
have Gaussian marginals, we do not expect to reach this 
bound, even with ace ^ 1 pixel. By magnifying and us- 



ing only the central part of the field, we could approach 
this upper limit. 



Witnessing Entanglement 

Despite not reconstructing a full density matrix, it is 
still possible to demonstrate non-classical behavior by 
comparing position-position and momentum-momentum 
correlation measurements directly. This has traditionally 
involved fitting the measurements to Eq. [5] and analyzing 
products or sums of conditional variances [5QH52]. 

We recently presented a more inclusive, entropic steer- 
ing inequality for witnessing continuous variable entan- 
glement with discrete measurements [20 , where the sum 
of the classical mutual information between position- 
position and momentum-momentum correlations is clas- 
sically bounded. For our system, all classically correlated 
measurements must satisfy 



I, 



< 2 log 



nd^dk 



(13) 



where dx and d^ are the respective widths of DMD pixels 
in the position and momentum basis. Note that ndxd^ is 
simply the bandwidth product for the DMD area, and is 
independent of n if total area does not change. 

The sum of the classical mutual information in conju- 
gate bases for each detector resolution is given in Fig. [7| 
The solid blue line provides right hand side of Eq. [T3j 
which must be exceeded to witness EPR steering. Error 
bars for the n = 256 and n = 576 cases are derived from 
simulation and include two standard deviations. In all 
cases, we show EPR steering both with optimal thresh- 
olding and fitting to the double Gaussian wavefunction 
(Eq. [5|. Even at 5% thresholding, there is a violation 
for 16 X 16 dimensions. Recall that simulations (Fig. [3| 
systematically under-represented the object mutual in- 
formation relative to measurement uncertainty, so mea- 
surement error is highly unlikely to have over-estimated 
this sum. For the fitted 32 x 32 dimensional result, we 
violate the classical bound by 6.6 bits. 



CONCLUSION 

In this article, we present a compressive sensing, 
double-pixel camera for characterizing the SPCD bipho- 
ton state with photon-counting detectors. This technique 
is very efficient, improving acquisition times over raster- 
scanning by n^/log(n) for n— pixel detectors. We image 
SPDC correlations at up to 1024 dimensions per detec- 
tor and measure a detector-limited mutual information 
of up to 8.4 bits. We also violate an entropic EPR steer- 
ing bound, indicating that these correlations are non- 
classical. More broadly, our results suggest that compres- 
sive sensing can be extremely effective for analyzing cor- 
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relations within large dimensional signals (eg. intensity- 
intensity correlations) . Potential applications range from 
verifying security in spectral correlations for energy-time 
QKD [53| to imaging through scattering media [54] . 
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